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Abstract 

Exact maximum clique finders have progressed to the point 
where we can investigate cliques in million-node social and 
information networks, as well as find strongly connected 
components in temporal networks. We use one such finder 
to study a large collection of modern networks emanating 
from biological, social, and technological domains. We 
show inter-relationships between maximum cliques and 
several other common network properties, including network 
density, maximum core, and number of triangles. In 
temporal networks, we find that the largest temporal strong 
components have around 20-30% of the vertices of the entire 
network. These components represent groups of highly 
communicative individuals. In addition, we discuss and 
improve the performance and utility of the maximum clique 
finder itself. 

Keywords max-clique, temporal strong components, 
social networks, information networks 

1 Introduction 

The max-clique problem is simply stated: 

Consider a graph G. A clique of size k in G 
is a subset of k vertices that forms a complete 
subgraph. The maximum clique problem is to 
find the largest k such that there is a clique of 
size k in G. 

Determining if a graph has a clique of size k is well 
known to be NP-complete. Approximating the largest 
clique is also known to be hard [18]. Clique is even hard 
for networks with power-law degree distributions and 
large clustering coefficients [1]. Clearly, MAX-CLIQUE is 
not, generally, fast. Yet, many real- world problems have 
features that do not elicit worst-cast behaviors from 
well-designed algorithms. In this manuscript, we use 
a state-of-the-art exact maximum clique finder [30] to 
investigate cliques in social and information networks. 1 
We also use a relationship between strongly connected 
components in temporal networks and maximum cliques 
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1 We use the terms graph and network interchangeably through- 
out the manuscript. Also, vertex and node are interchangeable 
too. 



in a reachability graph to find these temporal strong 
components [5, 25]. 

We have three goals with this manuscript. 

The first is to learn about the structure of social and 
information networks by comparing maximum clique 
size to other common network properties. We use 38 
networks for this study, obtained from various domains, 
that vary in size from a few hundred edges to 100 million 
edges. The results are presented in Section 2. We find 
interesting relationships between the average degree and 
maximum clique for network of Facebook friendships. 
Also, we study various upper-bounds on the maximum 
clique size for these networks. Given a connected graph 
G, it can be shown that the clique number co(G) — the 
size of the largest clique in G — obeys the following chain 
of bounds 

5(G) < u(G) - 1 < K(G) < A(G) 

where K(G) is the largest k for which there is a k-core in 
G (a k-core in G is a maximal induced subgraph of G in 
which every node has at least k neighbors), 6(G) is the 
minimum degree in G, and A(G) is the maximum degree 
in G. Further, for a connected graph containing at least 
one triangle, we can also show that w(G) < y/2 ■ T(G), 
where T(G) is the maximum number of triangles incident 
on any node in G. We find that, for most real-world 
networks, K(G) is typically within a factor of five of 
ui(G) whereas the ratio between y/2 ■ T(G) and uj(G) 
can be fairly large. 

Inspired by [25, 5], our second goal is to compute the 
largest temporal strong components. When each edge 
represents a contact — a phone call, an email, or physical 
proximity — between two entities at a specific point in 
time, we have a particular type of evolving network 
structure [12]. A temporal path in such a network 
represents a sequence of contacts that obeys time. A 
temporal strong component is a set of vertices where 
all temporal paths exist. See Section 3 for a formal 
treatment. Just like clique, checking if an evolving 
network has a temporal strong component of size k is NP- 
complete. Yet, temporal strong components correspond 
to cliques in a temporal reachability graph where each 
edge signifies the existence of a temporal path between 



vertices. Thus, finding maximum cliques enables us to 
find temporal strong components. We investigate the size 
and structure of these cliques on phone call networks 
and Twitter retweet networks in Section 3. Perhaps 
surprisingly, the largest temporal strong components 
typically involve only a small fraction (20% to 30%) of 
the total vertices. 

For a small portion of these experiments, where the 
networks are relatively dense, we find that the exact 
maximum clique method requires unacceptable time. 
In Section 4, we discuss extensions we made to the 
algorithm from [30] to reduce its runtime and increase 
its flexibility; in particular, we parallelize the clique- 
searches and use better upper-and-lower bounds on the 
maximum clique size to prune the search space further 
and allow additional flexibility. 

We make all the software underlying this manuscript 
available in the spirit of reproducible research. See: 

http : / /www . cs . pur due . edu/ -dgleich/ codes/ cliques 

2 Maximum Cliques in Social and Information 
Networks 

We begin by studying the properties of maximum cliques 
on modern networks. See Section 4 for a discussion of 
the maximum clique procedure we utilize in this section. 

2.1 Data. For all of these experiments, we discard 
any edge weights, self-loops, and only consider the 
largest strongly connected component. In contrast 
to the temporal components we describe in the next 
section, in this section we mean the standard strong 
components. If the graph is directed, we then remove 
non-reciprocated edges. We present a complete summary 
of some basic network statistics for the following datasets 
in the supplementary material. In total, we consider 8 
broad classes. 

1. Biological networks. We study a network 
where the nodes are proteins and the edges represent 
protein-protein interactions (yeast [17]). We also study 
the celegans metabolic network [10] where the nodes 
are substrates and the edges are metabolic reactions. 
Cliques in these networks represent biologically relevant 
modules. 

2. Interaction networks. Here, nodes repre- 
sent individuals and edges represent interaction in the 
form of (i) email exchanges (email-enron [21], email- 
enron-employ[8]), (ii) message posts (wiki-Talk [19], fb- 
messages [27]), (iii) cellphone calls (reality [11]), or (iv) 
face-to- face contacts (infect-dublin, infect-hyper [16]). 

3. Social networks. Nodes are again people, 
and edges represent social relationships in terms of (i) 
importance (wiki-Vote [19]), (ii) trust (epinions [31]), 
(iii) friendship or follower (orkut [24], youtube [23], soc- 



Live Journal [4], flickr [14], brightkite [7], gowalla [7], 
slashdot [21]). 

4. Facebook networks. The nodes are people and 
edges represent "Facebook friendships" (fb-Stanford, fb- 
Penn94, fb-Berkeley, fb-Duke, fb-Texas84 [35]). 

5. Technological networks. The nodes in these 
networks are either agents (p2p-Gnutella [22]) or routers 
(Internet- AS [15], as-Skitter [6], WHOIS [37], routers- 
rf[33]), and edges are observed communications between 
the entities. 

6. Collaboration networks. These are networks 
in which nodes represent individuals and edges represent 
scientific collaborations (MathSciNet [28], ca-CondMat, 
ca-AstroPh, ca-HepPh [20]). Large cliques in these 
networks are expected because they are formed when 
papers have many authors. Thus, cliques are only 
interesting if they are larger than the author list of 
any particular paper. 

7. Web link networks. Here, nodes are web-pages 
and edges are hyperlinks between pages (polblogs [2], 
wikipedia [13], web-Google [21]). The largest clique 
represents the largest set of pages where the author 
wishes users to freely move around. 

8. Retweet networks. Finally, in retweet net- 
works, nodes are Twitter users and edges represent 
whether the users have retweeted each other (political- 
retweets [9], twitter-copen [3]). We generated twitter- 
crawl ourselves, see the temporal network datasets for 
more information. Cliques are groups of users that have 
all mutually retweeted each other, and may represent an 
interest cartel. 

2.2 Analysis. The time taken for the exact clique 
finder, a clique heuristic, the size of the maximum clique, 
and the size of the heuristic clique are shown in Table 1 . 
The identified cliques range in size from 3 nodes to 239 
nodes. In all but one case the clique finder from [30] 
identified the largest clique. Even when it failed to 
find the largest clique on the flickr network, we have an 
upper-bound from the max fc-core that the clique is no 
larger than 309 vertices. In general, the method worked 
quickly, however, on the 106M edge orkut social network, 
it took almost 10 hours to find it. 

There are some interesting surprises in these clique 
numbers. First, the technological networks have large 
cliques with 50 and 60 vertices. We did not expect 
to find such large cliques in these networks as cliques 
ought to represent redundant edges that could be more 
efficiently handled by alternative topologies. Note that 
this finding may represent a by-product of the data 
collection methodology, rather than a property of the 
network. Second, the largest clique in a social network 
or Facebook network has 51 members. These are fairly 



Table 1: Max Cliques in Real-world Networks. 
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0.04 
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CELEGANS 
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0.04 


0.04 


MATHSCINET 
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25 
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0.05 
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0.06 
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26K 
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16 
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1.9M 


16 
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70K 


358K 


26 


23 
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37 
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3.1M 
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TEXAS 
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large groups for everyone to know each other. Third, and 
finally, the collaboration networks had large cliques, as 
we expect. We suspect that the clique with 239 members 
represents a single paper. 

Now, we wish to study the relationships between 
these clique sizes and various graph properties. A full 
table of these results is given in the supplementary 
material. Here, we summarize with more informative 
figures. We show how the size of the clique varies with 
the global clustering coefficient (ratio of triangles to 
wedges), the average degree, and average triangles per 
node in Figure 1. In the clustering coefficient plot (a), we 
see little relationship between the clustering coefficient 
and the maximum clique, whereas we see a much strong 
relationship between the average number of triangles and 
clique size (c). In the average degree plot (b), we also 
see the set of Facebook networks and the orkut network 
form a small group. Among these figures, the strongest 
relationship with the clique size is the log of the average 
triangle count. 

To conclude our analysis of these networks, we study 



the bounds on the size of the largest clique from the 
introduction: 

lo{G) - 1 < K(G) and u(G) - 1 < a/2T(G). 

These bounds are shown in Figure 2. We find that the 
fc-core bound is accurate up to a factor of 5, and usually 
much more accurate. In the supplementary material, 
we further include the fraction of the maximum clique 
contained with the highest fc-core. This value is usually 
high, except for the largest networks. This finding 
suggests that as graphs increase in size, the relationship 
between the fc-core and maximum clique changes. 

3 Temporal Strong Components 

In this section, we use the maximum clique method as a 
subroutine to compute temporal strong components [25] . 
Since this area is somewhat new, we review some of the 
basic definitions in order to motivate the relationship 
between cliques and temporal strong components. 

Definition 3.1. (temporal network) Let V be a set 

of vertices, and Et C V x V x R + be the set of temporal 
edges between vertices in V. Each edge (u,v,t) has a 
unique time t G R + . 

For such a temporal network, a path represents a 
sequence of edges that must be traversed in increasing 
order of edge times. That is, if each edge represents a 
contact between two entities, then a path represents a 
feasible route for a piece of information. 

Definition 3.2. (temporal paths) A temporal path 
from u to w in G — (V, Et) is a sequence of 
edges e\, . . . , such that the e\ = (u, v±, t\), . . . , e& = 
(uk,w,tk) where Vj = itj+i and tj < tj+\ for all j = 1 
to fc. We say that u is temporally connected to w if there 
exists a such temporal path. 

This definition echoes the standard definition of a path, 
but adds the additional constraint that paths must follow 
the directionality of time. Temporal paths are inherently 
asymmetric because of the directionality of time. 

Definition 3.3. (temporal strong component) 

Two vertices (u, w) are strongly connected if there 
exists a temporal path V from u to w and from w to 
u. A temporal strongly connected component (tSCC) is 
defined as a maximal set of vertices C C V such that 
any pair of vertices in C are strongly connected. 

While the vertices of a strong component in a graph 
define an equivalency class, and hence, we can partition 
a network into components, the same fact is not true of 
temporal strong components. The vertices in a temporal 
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Figure 1: Each point shows one of the 38 graphs, colored by the graph type. See the discussion in the text. 



strong component can overlap with those in another 
temporal strong component. 

Note that a temporal weak component is always 
equal to the connected component in the static graph [34] . 
We conclude by mentioning that stronger definitions of 
temporal components exist. For example, the temporal 
paths used to define a strong component can be further 
restricted to only use vertices from the component C 
itself. 

As previously mentioned, checking if a graph has a k- 
node temporal SCC is NP-complete [5, 25]. Nonetheless, 
we can compute the largest such strong component using 
a maximum clique algorithm. Let us briefly explain how. 

The first step is to transform the temporal graph 
into what is called a strong-reachability graph. For each 
pair of vertices in V, we place an edge in the strong 
reachability graph if there is a temporal path between 
them. In Algorithm 1, this corresponds to the REACH 
subroutine. That method uses the temporal ordering 
proposed by [29] and builds up temporal paths backwards 
in time. With this reachability graph, the second step of 
the computation is to remove any non-reciprocated edges 
and then find a maximum clique. That maximum clique 
is the largest set of nodes where all pairwise temporal 
paths exist, and hence, is the largest temporal strong 
component [25]. 

3.1 Temporal Graph Data. We use three types of 
temporal graph data. For all networks, we discard self- 
loops and any edge weights. In all cases, the nodes 
represent people. And in all cases, the largest temporal 
strong components reflect groups of people that meet, 
interact, or retweet with each other sufficiently often to 



Algorithm 1 Largest Temporal Strong Component 





Input: Dynamic Graph Q = (V,Et) 


1 


procedure max-tscc(£ = (V, E T )) 


2 


E R = reach(£) 


3 


Remove non-reciprocal edges from Er 


4 


Compute the max-clique in the graph (V, E R ) 


5 


Return the subgraph of G induced by C 


6 


procedure reach(£ = (V,E T )) 


7 


Sort edges to be in reverse time order 


8 


Set Er to be the set of all self-loops 


9 


for (i,j,t) e E do 


10 


Add (i, k) to En for all k where (j, k) € Er 


11 


Return Er 



transmit any message or meme. 

Contact networks. The edges are face-to- face 
contacts (infect-dublin, infect-hyper[16]). See ref. [32] 
for more details about these data. 

Interaction networks. The edges are observed 
interactions such as forum posts (fb- forum [26]) and 
private messages (fb- messages [27]). We also investigate 
a cellular telephone call network where the edges are 
calls (reality [11]). 

Retweet networks. Here, the edges are retweets. 
First, we investigate 26 retweet networks from various 
social and political hashtags. We collected these from 
Truthy [36] on September 20th, 2012. Of these, only a 
few had non-trivial temporal strong components which 
we discuss below. Also, we analyzed a network of 
political tweets centered around the November 2010 
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Figure 2: For a subset of 23 networks, sorted by increasing values of maximum degree over clique size, we show the 
relationship between the fc-core upper bound on the clique size (shown as the blue points), the maximum triangle 
upper-bound on clique size (shown as the green points), and the maximum degree upper-bound on clique size 
(shown as the red points). We normalize these by the size of the maximum clique so all values are larger than one. 
Thus, we can immediately determine that the fc-core bound is accurate up to a factor of 5, and usually much more 
accurate. The upper number is the ratio of maximum degree to average degree - a measure of the skewness of the 
degree distribution. 



election in the US (retweet [9]). The final dataset is a 
retweet and mentions network from the UN conference 
held in Copenhagen. The data was collected over a two 
week period (twitter-copen [3]). 

3.2 Results and analysis We now discuss the re- 
sults of running our temporal strong component algo- 
rithms. Figures 3 and 4 show the reachability and largest 
temporal strong component from two networks, respec- 
tively. For the both networks, the exact clique finder did 
not complete within the specified period. We mark that 
fact with an "*" to denote that it may not be the largest. 
The heuristic procedure found a clique with 1217 vertices 
for the reality network and 166 vertices for the retweets 
network. Given that the largest cores in these networks 
are 1236 and 175, respectively, in the worst case, we are 
only off by a few vertices. Our finding about the political 
party for the retweets also holds for the 166-core of the 
network, so we can be confident there is not a larger 
temporal strong component on the "political left." 

We summarize the remaining experiments on the 
temporal strong components in Tables 2 and 3. The first 
table presents basic network statistics of the original 
temporal graphs (G) as well as their strong reachability 
graphs (R a ). For both cases, we list the largest clique we 
computed with either the exact or heuristic clique finder. 
Like above, those cliques marked with a "*" represent 
heuristic computations because we only let the exact 
clique algorithm run for one hour. However, even for 



these heuristics, we find that they are strikingly close to 
the upper-bound on the clique size given by the A:-core. 
In fact, in all of the other examples, the largest clique 
in these networks was the largest fc-core. Thus, just as 
in the political retweet network above, we are confident 
that our results are close to the true largest component. 

The second table (Table 3) shows the time required 
for the maximum clique routine to return the largest 
temporal strong component - the largest clique in 
the reachability graph. As mentioned previously, we 
terminated the exact clique algorithm after an hour. In 
this table, we also include statistics on the largest strong 
component for the hashtag networks from Truthy. These 
show that there are heavily interacting groups of tweeters 
in these datasets that we can identify quickly (less than 
a second) using our methods. See the supplementary 
material for the network statistics on these hashtag 
networks. 

Overall, we find a few interesting properties of 
these temporal strong components. In the contact 
networks (infect-hyper and infect-dublin) , both of the 
largest strong components had about 100 vertices, 
despite the drastically different sizes of the initial 
dataset. We suspect this is a property of the data 
collection methodology because the infect-dublin data 
were collected over months, instead of days for the 
infect-hyper. In the interaction networks, the temporal 
components contain a significant fraction of the total 
vertices, roughly 20-30%. Finally, in the retweet 



(a) Reality 



(b) Retweets (political) 



Figure 3: In order to compute the largest temporal strong component, we first compute the strong reachability 
network. These networks are rather dense and may reveal clear community structure as in the retweet networks 
(and, the infect-dublin network, see the supplementary material). 




(a) Reality (b) Retweets (political) 



Figure 4: (a) In the largest* (see discussion) temporal strong component of the reality network, we see a small 
group of core users maintaining connectivity among various groups, (b) whereas the largest* component in the 
retweet network consists of 166 twitter users classified as politically "right" according to the original data with 
only a single exception. These users are a subset of one of the communities. 



Table 3: For each of the temporal networks, we list the 
largest clique found by the exact method (if it finished 
in an hour) or via the heuristic, as well as the time taken 
by each algorithm. The exact method completes quickly 
for the contact networks, and hashtag networks, but 
has trouble with the interaction networks and retweet 
networks. In these cases, the heuristic gives a good 
lower-bound, that isn't too far away from the fc-core 
upper-bound. 
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networks, these temporal strong components are around 
20% of the total vertices in the reachability graph. Given 
the strong communication pattern between these groups, 
they are good candidates for the center of a community 
in these networks. 

4 Maximum Clique Algorithms 

We now provide some algorithmic details of the maxi- 
mum clique finders we have used in the previous two 
sections. We first outline the algorithms as introduced 
in [30] and then discuss the extensions we made in this 
work to improve performance and utility. 

4.1 The Exact Algorithm Given an undirected 
graph G = (V, E), let C v denote a clique of the largest 
size containing the vertex v. A maximum clique in 
G can be found by computing C v for every vertex 
v in V and then picking the largest among these. 
The exact algorithm we use follows this basic strategy, 
but drastically speeds up the process by avoiding the 
computation of every C u , u € V, that would eventually 
be smaller than the global maximum. For such pruning 



purposes, the algorithm maintains the size of the largest 
clique found so far (MaxSoFar). 

For a given vertex v, C v is evaluated by performing 
a depth-first search starting from v. At each depth of 
the search, the algorithm compares MaxSoFar against 
the number of remaining vertices that could potentially 
constitute a clique containing the vertex v. If the latter 
number is found to be smaller, the algorithm backtracks 
(the search is pruned). The algorithm employs several 
other pruning steps in addition to the backtracks. For 
example, a depth-first search from a vertex v is invoked 
only if the degree of v is greater than or equal to 
MaxSoFar, since otherwise the largest clique containing 
v would certainly be smaller than MaxSoFar. 

4.2 The Heuristic In its search for a maximum 
clique in the entire graph, the exact algorithm described 
explores for every vertex v feasible cliques along a subset 
of the edges incident on v. The heuristic instead picks 
just one of these edges for further exploration. The 
picked edge is one incident on a vertex with the largest 
degree among the neighbors of v. The rationale for this 
choice is that the neighbor of v with the largest degree 
is more likely than other neighbors of v to be part of 
the largest clique on v (C v ). The exact algorithm in the 
worst case has exponential runtime, whereas the runtime 
of the heuristic is bounded by 0(\V\ ■ A 2 ), where A is 
the maximum degree in the input graph. 

4.3 New Extensions In this work, we extended both 
the exact algorithm and the heuristic in two major ways. 

First, we parametrized the exact algorithm such that 
it takes both a lower bound lb and an upper bound ub 
on the size of the clique to be computed. The algorithm 
starts by setting MaxSoFar to lb — thus a good lower 
bound helps speed up the algorithm by allowing early 
pruning. And the algorithm stops the search as soon as 
a clique of size equal to ub is found — in the absence of a 
bound, the algorithm may continue to iterate over the 
remaining vertices even if the global maximum clique 
had already been found. 

Second, we parallelized the algorithms. The compu- 
tations of per- vertex maximum cliques (CVs) over the 
vertices are independent of each other — and hence can be 
done in parallel — except for one issue: the need for infor- 
mation on the size of the global maximum clique found 
so far. However, since this information is needed only for 
pruning purposes, one need not insist on disseminating 
it to all processing units as soon as it is updated. In- 
stead, one could work with a locally computed maximum 
clique for pruning purposes and occasionally exchange 
the global value at regular synchronization points. Fol- 
lowing this strategy, we parallelized, using OpenMP 



Table 2: Network statistics of the dynamic networks and their strong reachability graphs. The cases where the 
max k-core gives the exact size of the max clique are highlighted in gray. Here, (i max and d avg are the largest and 
average degrees, respectively. Also, R is the mean clustering coefficient; T and T avg are the maximum number and 
average number of triangles incident on a vertex; K is the largest degree for a fc-core to exist, and lo is the clique 
number (or a lower-bound from the heuristic if the value has an *). 
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threading, both the exact algorithm and the heuristic. 

The first improvement, parameterization, makes 
possible added flexibility. By intentionally setting ub 
to be smaller than a true upper bound on a maximum 
clique, a user may force the algorithm to terminate 
when a clique of the desired size (ub) is found. Figure 5 
illustrates the benefit of this. It shows how the runtime 
of the algorithm varies as the ub is varied for three large 
networks from our dataset (as-skitter, texas, and 
whois). Note that, in general, the algorithm detects 
cliques of small sizes extremely fast: in fact cliques of up 
to 90% of the maximum clique size can be computed at 
under 5% of the time it takes to compute the maximum 
clique. The plots also provide some insight into the 
different scenarios the three networks represent: in the 
as-skitter network, computing a clique of one size 
takes nearly the same time as computing a clique of just 
about any other size — meaning that cliques are easy to 
find — whereas in the other two networks it is increasingly 
difficult to detect larger cliques. 

5 Discussion 

In the title, we hypothetically asked "What if clique 
were fast?" We found that a state-of-the-art clique 
finder can identify maximum cliques in sparse networks 
with hundreds of millions of edges in a few seconds, 
but in the worst case, a few hours. These results show 
that cliques in social and information networks are not 
always difficult to find and hint at a deeper structural 
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Figure 5: Runtime behavior as upper bound on the 
clique algorithm is varied. 

property of these networks. The same maximum clique 
finder was less successful at identifying maximum cliques 
in temporal reachability networks (which give rise to 
temporal strong components). These networks have 
significant structure implied by the temporal edges 
that are hidden from the clique finder. This structure 
and the density of these networks may make these 
problems especially challenging. In the future, we plan 
to investigate additions to the clique finder to exploit 
such structure. 
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Appendix 



Table 4: Network statistics of the static graphs from section 2, including the size of the maximum clique and 
corresponding upperbounds. Here, d max and d avg are the largest and average degrees, respectively. Also, R is 
the mean clustering coefficient; T and T avg are the maximum number and average number of triangles incident 
on a vertex; K is the largest degree for a fc-core to exist, uj is the clique number, and Tx = ^j9p * s ^ ne recall 
measurement by using the k-core to find the maximum clique. 
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Table 5: Network statistics of the dynamic retweet graphs and their strong reachability graphs from section 
3. 
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(a) Reachability Graph (b) Largest Strong Temporal Component 



Figure 6: Infectious Contact Network (Dublin) from Section 3 



